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Abstract. We consider the discretization in time of a system of parabolic stochastic partial differ- 
ential equations with slow and fast components; the fast equation is driven by an additive space-time 
white noise. The numerical method is inspired by the Averaging Principle satisfied by this system, 
and fits to the framework of Heterogeneous Multiscale Methods. 

The slow and the fast components are approximated with two coupled numerical semi-implicit 
Euler schemes depending on two different timestep sizes. 

We derive bounds of the approximation error on the slow component in the strong sense - 
approximation of trajectories - and in the weak sense - approximation of the laws. The estimates 
generalize the results of |12| in the case of infinite dimensional processes. 

1. Introduction 

In this paper, we are interested in the numerical approximation of a randomly-perturbed system 
of reaction-diffusion equations that can be written 



for t > 0, ^ S (0, 1), with initial conditions x^{0, ^) = x(S^) and y^(0, S,) = ^, and homogeneous Dirich- 
let boundary conditions x^(t,0) = x^{t,l) = 0,y^{t,0) = y^{t,l) = 0. The stochastic perturbation 

^^Qi^'^ is a space-time white noise and e > is a small parameter. 

In the recent article [3], we have proved that an Averaging Principle holds for such a system, and 
we have exhibited an order of convergence - with respect to e - in a strong and in a weak sense: 
the slow component x"^ is approximated thanks to the solution of an averaged equation. In this 
article, we analyse a numerical method of time discretization which reproduces this averaging effect 
at the discrete time level. More precisely, our aim is to build a numerical approximation of the slow 
component x*^, taking care of the stiffness induced by the time scale separation. The Heterogeneous 
Multiscale Method - HMM - procedure can be used, as it is done in \12\ for SDKs of the same 
kind. First we recall the general principle of such a method, which has been developped in various 
contexts, both deterministic and stochastic - see the review article and the references therein, 
as well as [9], [TO], [T5]. 

In system (II. ip . the two components evolve at different time scales; x*^ is the slow component of 
the problem, while the fast component y'' has fast variations in time. We are indeed interested in 
evaluating the slow component, which can be thought as the mathematical model for a phenomenon 
appearing at the natural time scale of the experiment, whereas the fast component can often be 
interpreted as data for the slow component, taking care of effects at a faster time scale. Instead 
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of using a direct numerical method, which might require a very small time step size because of 
the fast component, we use a different solver for each time scale: a macrosolver and a microsolver. 
The macrosolver leads to the approximation of the slow component; it takes into account data 
from the evolution at the fast time scale. The microsolver is then a procedure for estimating the 
unknown necessary data, using the evolution at the microtime scale, which also depends on the 
evolution at the macrotime scale. We emphasize a difference with the framework of |12j : instead of 
analysing various numerical schemes, the infinite dimensional setting implies that we only focus on 
a semi-implicit Euler scheme. 

In Section 14. ![ we state the two main Theorems of this article: we show a strong convergence 
result - Theorem 14.11 - as well as a weak convergence result - Theorem 14.21 - which are similar 
to the available results for SDEs. Compared to p2], we propose modified and simplified proofs 
leading to apparently weaker error estimates; we made this choice for various reasons. First, even if 
apparently we get weaker estimates, under an appropriate choice of the parameters the cost of the 
method remains of the same order. Second, the generalization of the final dimensional results would 
not yield the same bounds, due to the regularity assumptions we make on the nonlinear coefficients 
of the equations. Finally, we can extend the weak convergence result to the situation where the fast 
equation only satisfies a weak dissipativity assumption. 

In the case of a linear fast equation - when g is equal to - it is well-known that the second 
equation in (II. ip is dissipative. In the general case, we make assumptions on g so that this property is 
preserved for y*^ - see Assumptions 13. 8] and 13.91 below. The fast equation with frozen slow component 
- defined by (jS.ip in the abstract framework - then admits a unique invariant probability measure, 
which is ergodic and strongly mixing - with exponential convergence to equilibrium. Under the 
strict dissipativity condition 1 3. 8 1 we can prove that the averaging principle holds in the strong and 
in the weak sense; moreover the "fast" numerical scheme has the same asymptotic behaviour as the 
continuous time equation. If we only assume weak dissipativity of Assumption 13.91 the averaging 
principle only holds in the weak sense, and we can not prove uniqueness of the invariant law of the 
fast numerical scheme. Nevertheless, in the general setting [2] gives an approximation result of the 
invariant law of the continuous time equation with the numerical method which is used to prove 
Theorem 14. 2} the order of convergence is 1 /2 with respect to the timestep size - the precise result 
is recalled in Theorem 15.111 

The paper is organized as follows. In Section [21 we give the definition of the numerical scheme. 
We then state the main assumptions made on the system of equations. In Section 14.11 we state 
the two main Theorems proved in this article, while in Section 14.21 we compare the efficiency of 
the HMM scheme with a direct one in order to justify the use of a new method. Before proving 
the theorems, we give some useful results on the numerical schemes. Finally the last two sections 
contain the proof of the strong and weak convergence theorems. 



Instead of working directly with system (jl.ip . we work with abstract stochastic evolution equa- 
tions in Hilbert spaces H: 



with initial conditions X^{0) = x e H , Y^{0) = y £ H . 

To get system (j2.ip from (II. ip . we take H = L'^{0, 1); the linear operators A and B are Laplace 
operators with homogeneous Dirichlet boundary conditions - see Example l3.2l - and the nonlinearities 
F and G are Nemytskii operators - see Example 13. lUI The process (VF(t))t>o is a cylindrical Wiener 
process on H - see Section [3.11 For precise assumptions on the coefficients, we refer to Section [3l 
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2. Description of the numerical scheme 





We recall the idea of the Averaging Principle - proved in the previous article [3]: when e goes to 
0, can be approximated by X defined by the averaged equation 

(2.2) ^=^XW+T(XW) 

X(t = 0) = X G H- 

the error is controlled in a strong sense by Ct^/^-" and in a weak sense by Ce^ where r > can 
be chosen as small as necessary, and where C is a constant. 
The averaged coefficient F - see (15. 4p - satisfies 



F{X)= [ F{X,y)fi''{dy)= lim nF{X,Yx{s,y))], 

where fj,-^ is the unique invariant probability measure of the fast process Yx with frozen slow 
component - more details are given in Section [5.11 

To apply the HMM strategy, we need to define a macrosolver and a microsolver. We denote by 
At the macrotime step size, and by 6t the microtime step size. Let also T > be a given final time. 

The construction of the macrosolver is deeply based on the averaging principle: for nAt < T 
X'^(nAt) can be approximated by X(nAt). If the averaged coefficient F was known, one could build 
an approximation with a deterministic numerical scheme on the averaged equation; nevertheless in 
general it is not the case, and the idea is to calculate an approximation of this coefficient on-the-fiy, 
by using the microsolver. 

Therefore the macrosolver is defined in the following way: for any < n < [^J := no, 

Xn+l = Xn + /S.tAXn+1 + AtFn, 

with the initial condition Xq = x. Fn has to be defined; before that, we notice that the above 
definition leads to a semi-implicit Euler scheme - we use implicitness on the linear part, but the 
nonlinear part is explicit. If we define a bounded linear operator on H by S^t = {I ~ A.tA)~^, we 
rather use the following explicit formula 

(2.3) Xn+l = SAtXn + AtSAtFn- 

We want Fn to be an approximation of F{Xn). The role of the microsolver is to give an ap- 
proximation of Yx„ - the fast process with frozen slow component Xn, when n is fixed; moreover 
we compute a finite number M of independent replicas of the process, in order to approximate 
theoretical expectations by discrete averages over different realizations of the random variables, in 
a Monte-Carlo spirit. Therefore the microsolver is defined in the following way: we fix a realization 
index j S {!,..., M}, and a macrotime step n; then for any m > 

'^n,m+l,j — ^n,m,j H ~BYn^rn+l,j H ~G(^XmYn^rn,j 

As above we can give an explicit formula 

(2.4) Yn^m+l,j = RstY-n^mJ H R§lG{Xn,Yn^rn,j) + \ — RstCn,m+l,j, 

where for any r > Rr = (/ — tB)^^. 
The noises C,n,m,j defined by 




/5t 

where (T^('^J)) 

i<i<A/,o<n<no a^re independent cylindrical Wiener processes on H. It is essential to 
use independent noises at each macrotime step. It is important to remark that this equation is 

3 



well-posed in the Hilbert space H - see the conditions required when a cylindrical Wiener process is 
used in Section [3. II - since Rr is a Hilbert-Schmidt operator from H to H, under assumptions given 
in Section [3j 

The missing definition can now be written: is given by 

M UT+N-l 

(2.5) ^- = MivE E nXn,Y^,m,j). 

riT is the number of microtime steps that are not used in the evaluation of the average in while 
N is the number of microtime steps that are then used for this evaluation. Each macrotime step 
then requires the computation of mo = ?^T + — 1 values of the microsolver. 

At each macrotime step Yn^m,j must be initialized at time m = 0. In our proofs, this is not as 
important as in [12j, but for definiteness we use the same method: we initialize with the last value 
computed during the previous macrotime step: l^_|_i^oj = ^,mo,i) while i^,o,j = V- 

The aim of the analysis for HMM schemes is to prove that under an appropriate choice of the 
parameters nT,N,M of the scheme, we can bound the error by expressions of the following kind, 
where r > is chosen as small as necessary: for n = no = [^J , we have the strong error estimate 

E\X'{nAt) -Xn\<C ^ ^^l-r ^ 

and if $ is a test function of class C^, bounded and with bounded derivatives, we have the weak 
error estimate 

|E$(X^(nAt)) - E$(X„)| < C ^e^"'' + At^-'' + 

The origin of the three error terms appears clearly in the proofs - see Sections [6] and [71 the first 
one is the averaging error, the second one is the error in a deterministic scheme with the macrotime 
step, and the third one is the weak error in a scheme for stochastic equations with the microtime 
step. We recall that in the SPDE case the strong order of the semi-implicit Euler scheme for the 
microsolver used here is 1/4, while the weak order is 1/2, while in the SDE situation the respective 
orders are 1/2 and 1. The macrosolver is deterministic, so that the order is 1. Precise results for any 
choice of nT,N,M are given in Theorems 14.11 and 14.21 below, while the choice of these parameters 
is explained in Section [4.21 

3. Assumptions 

As mentioned above, System (j2.ip satisfies an averaging principle, and strong and weak order 
of convergence can be given. The HMM method relies on that idea. The natural assumptions 
are basically the same as the hypothesis needed to prove those results, but must be strenghtened 
sometimes. The typical kind of coefficients is specified in Examples 13.21 and 13.101 

3.1. The cylindrical Wiener process and stochastic integration in H. Here, we recall the 
definition of the cylindrical Wiener process and of stochastic integral on a separable Hilbert space 
H - its norm is denoted by \.\h or just |.|. For more details, see [6]. 

We first fix a filtered probability space (O, J^, (J^()t>o,P). A cylindrical Wiener process on H is 
defined with two elements: 

• a complete orthonormal system of H, denoted by {qi)i^j, where / is a subset of N; 

• a family {/3i)i^j of independent real Wiener processes with respect to the filtration {{J^t)t>Q)' 

4 





(3.1) w{t) = Y,m(ii- 

i&I 

When I is a finite set, we recover the usual definition of Wiener processes in the finite dimensional 
space RI^L However the subject here is the study of some Stochastic Partial Differential Equations, 
so that in the sequel the underlying Hilbert space H is infinite dimensional; for instance when 
H = L^(0, 1), an example of complete orthonormal system is (q^) = (sin(A;.))fc>i - see Example 13.21 

A fundamental remark is that the series in (I3.ip does not converge in H; but if a linear operator 
^ : H ^ K is Hilbert-Schmidt, then ^W{t) converges in L^(Q, H) for any i > 0. 

We recall that a linear operator ^ : H ^ K is said to be Hilbert-Schmidt when 

+ 00 

where the definition is independent of the choice of the orthonormal basis (g^) of H. The space of 
Hilbert-Schmidt operators from H to K is denoted C2{H,K); endowed with the norm \-\c2(H,K) it 
is an Hilbert space. 

The stochastic integral Jg '^{s)dW{s) is defined in K for predictible processes ^' with values in 
C2{H,K) such that x)^'^ ^ ^-^J moreover when \I' G L'^{Q x [0,t]; C2{H, K)), the 

following two properties hold: 

^{s)dW{s)\\=¥. \^{s)\l^^H,K)ds, (Ito isometry), 

E / ^{s)dW{s) = 0. 
Jo 

A generalization of Ito formula also holds - see |6] . 
For instance, ii v = '^^k&i v^Qk ^ we can define 

< w{t),v>= r < v,dW{s) >=Y,Pk{t)vk; 
•^^ fceN 

we then have the following space-time white noise property 

E < W{t),Vl X W{s),V2 >= t A S < Vl,V2 > . 

Therefore to be able to integrate a process with respect to W requires some strong properties 
on the integrand; in our SPDE setting, the Hilbert-Schmidt properties follow from the assumptions 
made on the linear coefficients of the equations. 

3.2. Assumptions on the linear operators. First, we have to specify some properties of the 
linear operators A and B coming into the definition of system (|2.ip ; we assume that the linear parts 
are of parabolic type, with space variable ^ G (0, 1). 

We assume that A and B are unbounded linear operators, with domains D(A) and D(B), which 
satisfy the following assumptions: 

Assumptions 3.1. (1) There exist complete orthonormal systems of H, {ek)ke'M (^nd {fk)keN, 
and {Xk)ke'N o,nd {p,k)ke'M non- decreasing sequences of real positive numbers such that: 

Ack = —^k^k for all k 

Bfk = -f^kfk for all G N. 

We use the notations A := Aq > and /i := /xq > for the smallest eigenvalues of A and B. 
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(2) For every k £ N, fk is Lipschitz continuous and bounded on [0,1], with a uniform control 
with respect to k: there exists C > such that for any (,i,£,2 G [0, 1] 

< C and Ifki^i) - fk{^2)\ < C^\^, - 6|. 

(3) The sequences (A^) and (/i^) go to +oo; moreover we have some control of the behaviour of 
(fik) given by: 

+ 00 ^ 

Example 3.2. A = B = with domain H'^{0, 1) n Hq{0, 1) C L^(0, 1) - homogeneous Dirichlet 
boundary conditions: in that case Afc = /x^ = vr^A;^, and efc(^) = fk{C) = \/2 sin(/i:7r^) - see [Ij. 

In the abstract setting, powers of —A and —B, with their domains can be easily defined: 

Definition 3.3. For a,b G [0,1], we define the operators {—AY and {—B)^ by 

oo 

{-Arx = ^Xtxkek€H, 

oo 

{-B)'y = Y,4ykfkeH, 



k=0 



with domains 

Di-A)" = \ x = Y^ XkCk G H; \x\f_^^a := ^(Afc)^'']^^]^ < +cx) 



+ 00 +00 



A:=0 A:=0 

+ 00 +00 



D{-B)' = }y = Y^y,f,eH, \y\l_^^, := Y^il^kfV <+oo\. 

{ k=0 k=0 ) 

On D{—A)"', the norm ] . and the Sobolev norm of H^°' are equivalent: when x belongs to 

a space D{—A)°'^ the exponent a represents some regularity of the function x. 
When a > 0, we can also define a bounded linear operator (— ^)~" in H with 

+ 00 

k=Q 

where x = Yl,k=o ^kS-k G H. 

Under the previous assumptions on the linear coefficients, it is easy to show that the following 
stochastic integral is well-defined in H, for any t > 0: 

(3.2) W^{t)= f e^'-'^^dW{s). 

Jo 

It is called a stochastic convolution, and it is the unique mild solution of 

dZ{t) = BZ{t)dt + dW{t), Z(0) = 0. 

Under the second condition of Assumption 13. 1^ there exists (5 > such that for any t > we 
have Jq ^l^^'^l^jC-ff)^'^ ^ +oo; it can then be proved that has continuous trajectories - via the 
factorization method^ see [6] - and that for any 1 < p < +oo, any < 7 < 1/4, there exists a 
constant C{p,'y) < +00 such that for any t > 

(3.3) E\W''{t)\l^^,<C{pn)- 
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3.3. Assumptions on the nonlinear coefficients. We now give the Assumptions on the nonhn- 
ear coefficients F, G . H x H ^ H . First, we need some regularity properties: 

Assumptions 3.4. We assume that there exists < rj < ^ and a constant C such that the following 
directional derivatives are well-defined and controlled: 

• For any x,y G H and h£ H, \DxF{x,y).h\ < C\h\}{ and \DyF(x,y).h\ < C\k\H- 

• For any x,y e H, h € H, k e Di-A)"^, \Dl^F{x,y).{h,k)\ < C\h\H\k\(^_A)v ■ 

• For anyx,y£H,heH,ke Di-B)^, \DlyF{x,y).{h,k)\ < C\h\H\k\^-B)v ■ 

• For anyx,y£H,heH,ke D{-Bf, \DlyF{x,y).{h,k)\ < C\h\H\k\^-B)v ■ 

• For any x,y e H, hG Di-A)"^, ke H, \DlyF{x,y).{h,k)\ < C\h\,^_A)v\k\H ■ 
We moreover assume that F is bounded. 

We also need: 

Assumptions 3.5. For r] defined in the previous Assumption we have for any x,y G H and 

h,k£ H 

\{-Ar^Dl,F{x,y).{h,k)\ < C\h\H\k\H 

\{-A)-^DlyF{x,y).{h,k)\ < C\h\H\k\H 

\{-A)-^DlyF{x,y).{h,k)\ < C\h\H\k\H. 

We assume that the fast equation is a gradient system: for any x the nonlinear coefficient G{x, .) 
is the derivative of some potential U. We also assume regularity assumptions as for F. 

Assumptions 3.6. The function G is defined through G{x,y) = VyU{x,y), for some potential 
U : H X H ^ M. Moreover we assume that G is bounded, and that the regularity assumptions given 
in the Assumption\3.4\ are also satisfied for G. 



For G, we need a stronger hypothesis than for F - in order to get Proposition 15.51 . Assumption 
33] becomes: 



Assumptions 3.7. We have for any x,y £ H , h,k £ H , z £ L°°(0, 1) 

I < Dx^G{x,y).{h,k),z >h \ < G\h\H\k\H\z\L^{o,i)- 

Finally, we need to assume some dissipativity of the fast equation. Assumption 13.81 is necessary 
to obtain strong convergence in the Averaging Principle, while Assumption 13.91 is weaker and can 
lead to the weak convergence. 

Assumptions 3.8 (Strict dissipativity). Let Lg denote the Lipschitz constant of G with respect to 
its second variable; then 

(SD) Lg < /i, 

where fj, is defined in Assumption \3. 1[ 

Assumptions 3.9 (Weak Dissipativity). There exist c > and G > such that for any y G L){B) 

(WD) <By + G{y),y><-c\yf + G. 

The second Assumption is satisfied as soon as G is bounded, while the first one requires some 
knowledge of the Lipschitz constant of G. 

Example 3.10. We give some fundamental examples of nonlinearities for which the previous as- 
sumptions are satisfied: 

• Functions F,G : H x H ^ H of class C^, bounded and with bounded derivatives, such that 
G{x,y) = 'VyU{x,y) and satisfying (ISPp fit in the framework, with the choice rj = 0. 
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Functions F and G can be Nemytskii operators: let / : (0, 1) x — J- M 6e a measurable 
function such that for almost every ^ G (0, 1) /(^, .) is twice continuously differentiahle, 
bounded and with uniformly bounded derivatives. Then F is defined for every x,y € H = 
L2(0,1) by 

F{x,ym = f{^,x{C),y{C)). 
For G, we assume that there exists a function g with the same properties as f above, such 
that G(x,y)(^) = g{^,x{^),y{£^)). The strict dissipativity Assumption \SF\ is then satisfied 
when 

sup \ — {^,,x,y)\ < fi. 

Ce(a,b),a:GR,j/eR CfV 



The conditions in Assumption \ 3.4\ are then satisfied for F and G as soon as there exists 
7] < 1/2 such that D{—A)'^ and D{—B)'^ are continuously embedded into L°°{0, 1) - it is the 
case for A and B given in Example \3.S\ with r] > 1/4. Assumptions \3.5\ and \3.7\ are also 
satisfied. 



We then deduce that the system (12. ip is weh-posed for any e > 0, on any finite time interval 
[0,T]. Under Assumptions [3711 13.4| 13. 6[ the nonhnearities -F and G are Lipschitz continuous, and 
the fohowing Proposition is classical - see [6]: 

Proposition 3.11. For every e > 0, T > 0, x & H , y £ H , system (|2.ip admits a unique mild 
solution {X',Y') G {L^{n,C{[0,T], H))^ : for any t e [0,T], 

X'{t) = e^^x+ j e^*-''>^F{X'{s),Y'{s))ds 

Y%t) = e-^^y + - e^^G{X'{s),Y%s))ds + ^ e^^dVF(s). 
^ Jo Jo 

4. Convergence results 

4.1. Statement of the Theorems. We can now state the main results: the numerical process 
(Xn) defined by (|2.3p approximates the slow component X^(nAt) of system (|2.ip . with strong and 
weak error estimates given in the Theorems. 

If the Strict Dissipativity Assumption 13.81 is satisfied, we can prove the following: 

Theorem 4.1 (Strong convergence). Assuming (jSPp . for any < r < 1/2, < k < 1/2, T > 0, 
eo > 0, Aq > 0, To > 0, there exists C > such that for any < e < eo, < At < Aq, 6t > such 
that T = Y ^ tq C'lT'd 1 < n < uq = [-^J 



E\X'{nAt) - Xn\ < c(^e^/2-'- + i + At^"''^ 




+ C 

'm{n^ + 1 

Under the more general Weak Dissipativity Assumption 13. 9| we can prove the following: 

Theorem 4.2 (Weak convergence). Assume that X £ D{{-Ay) and y € D{{-BY), with 6,13 £ 
]0, 1]. Let ^ : H ^ H bounded, of class C^, with bounded first and second order derivatives. Then 
with the weak dissipativity assumption (IWDp . for any < r < 1, k < 1/2, T > 0, eo > 0, Aq > 0, 



To > 0, there exists C > such that for any < e < eo, < At < Aq, 5t > such that t = y ^ tq 
and 1 < n < riQ = [^J 

\E^{X'{nAt)) - E$(X„)| < Cj^e^^"^ + ^ + At^"^^ 

Moreover, if we assume (jSPp in Theorem 14.21 the factor (1 + — ^•^Sty/2~K ) replaced by 1. 

If we look at the estimates of Theorems 14.11 and 14.21 at time n = uq, the factor is of size 
At = 0(Ati-^). 

4.2. Some comments on the convergence results. The proofs rely on the following decompo- 
sition, which explains the origin of the different error terms: 

(4.1) X'(nAt) -Xn = X'{nAt) - X{nAt) + X(nAt) - X„ + X„ - 

The numerical process is defined in (j5.12p below: it is the solution of the macrosolver with a 

known F, while (Xn) is solution of the macrosolver using The continuous processes X and X^ 
are defined at the beginning of Section [2j 

The first term is bounded thanks to the averaging result, using strong and weak order of conver- 
gence results - see the article [3]. The second term is the error in a deterministic numerical scheme, 
for which convergence results are classical; we recall the estimate in Proposition 15.131 The third 
term is the difference between the two numerical approximations, and the main task is the control 
of this part: we show that an extension of the averaging effect holds at the discrete time level, where 
Xn plays the role of an averaged process for X„. 

When we look at the Theorems 14.11 and 14.21 we first remark that we obtain the same kind of 
bounds as in the finite dimensional case of [12j. However we notice some differences; they are due 
both to the infinite dimensional setting and to different proofs. 

First, the weak order of the Euler scheme for SPDEs is only 1/2 - see [7J - while it is 1 for SDKs; 
in the estimate, the strong order never appears, and this is one of the main theoretical advantages 
of the method. For completeness, we recall that the strong order is 1/4 in the SPDE case - see |14| 

- and 1/2 for SDEs. In fact, at least in the strictly dissipative case, we are comparing the invariant 
measures of the continuous fast equation with the invariant measure of its numerical approximation 

- see Theorem 15.111 and Corollary 15.121 In the weakly dissipative case, we use the weak error 
estimates where the constants do not depend on the final time. 

We need some regularity on the initial condition x in Theorem 14.21 since we apply the weak order 
averaging Theorem of [3|; nonetheless the parameter 6 > does not appear in the different orders 
of convergence, but only in the constant C. 

In Theorem 14. 2| we do not exactly obtain the same estimate as in |12) . since M does not appear: 
as a consequence, the estimate is not improved when M is increasing. Since we are looking at some 
weak error estimate, this can seem natural; first M appeared in the estimate of [ISj only because 
of some strong error estimate used in the proof, while here to prove Theorem 14.21 we always think 
with a weak error state of mind. Second, the corresponding term in the weak estimate of |12| is 
easily controlled by At, which already appears in another part of the error since we use a first order 
scheme for the macrosolver - and not a general scheme - so that there is no need for getting a better 
estimate. 

The proofs of Theorems 14. 1 1 and 14.21 are inspired by the ones in |12J, but are different. The strong 
error is analysed in a global way, as in the proof of the strong order Theorem in [3] . Moreover we do 

9 



not need a counterpart of Lemma 2.6 in [12], and we thus think that our method is more naturah 
For the control of the weak error, we introduce a new appropriate auxihary function. 

As explained in the introduction, we present here some simplified results and proofs. In the stricly 
dissipative case, we could go further in the analysis of the error, thanks to the initialization procedure 
of the scheme, and to additional exponential convergence results. But after a very technical work, 
due to the regularity assumptions made on the nonlinear coefficients F and G we would only obtain 
a factor At^"*""*^ instead of At in the finite dimensional case - where is any r > 0, which can be 
chosen as small as necessary, and i] is defined in Assumption 13.41 and depends on the regularity 
properties of the nonlinearities F and G. Nevertheless, the cost of the scheme remains of the same 
order even if we do not use those better estimates. 

We consider that e is fixed, and then the numerical method depends on several parameters 
N, M, riT, dt, At; we would like to give some explicit choices of the parameters for which we have a 
simple bound, and for which a direct scheme, where the stiffness of the system (12. ip is not treated, 
is less efficient, for the cost defined by 

MrriQ 



cost 



At 
= 1. 



This is the total number of microtime steps for T 

We take n = no'- then ^ < CAt. 

The time scale separation parameter e is supposed to be very small, while we fix some tolerance 
tol for the error in the numerical scheme: more precisely, we want the error to be 

E\X^{nAt) - Xn\< C(e^/2-'' + tol) (strong error) 
|E$(X'(nAt)) - E$(X„)| < G{e^-'' + tol) (weak error). 



where r satisfies either r < - strong error - or r < - weak error 
are defined by the conditions e^/^^'"^ = o{tol) (strong error) and 



if the parameters and 
g errorj and e" = o{tol) (weak error): 
the numerical error is dominant with respect to the averaging error. We want to show that we can 
choose the parameters of the scheme such that each term of the estimate of the Theorem 14.11 are of 
size tol, except the first one, and such that the cost of the scheme is lower than the cost of a direct 
scheme. 

• We first notice that the choice for At and — is easy: 



(4.3) 



At 

5t 



tol^-^ 



tol^/2 



where r < or r < r^, and k < 1/2 are fixed. The choice of other parameters N,M,nT 
changes when we look at the strong or at the weak error. 

We first focus on the strong convergence case. We consider the case when either M = 1 or 
N = 1. We obtain 



M = 1 



N = 1 



parameters 



N 



tol-'^^- 



1/2- 



UT ^ tol i/2-« log(toZ-^) 



M ^ tol^- 



--2 



\og{tol-^)tol 1/2- 



cost 



tol ^ i/a--. 



tor 



1/2- 



log (toZ 



Unlike in the finite dimensional situation, the bound depends on a factor At^~^~^ - 
where r] is linked to the regularity of the nonlinear coefficients F and G. The charge of 
this difference could be taken by either or n^; the advantage of choosing ut is that the 
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difference only involves a logarithmic factor on the final cost. As a consequence, we can 
make no difference between the choices = 1 and M = 1. 

We can now make a comparison with the cost coming from the use of a direct scheme: 
since the strong order of Euler scheme for SPDEs is 1/4 — k, the error can be bounded by 
C{Yy^^~^ for some constant C. Therefore to have a bound of size tol, the time step size 

6t much satisfy - ~ tol iT*^ , This leads to a cost 

6t 

we conclude that in this situation the HMM numerical method is better, since the ratio of 
the cost tends to when e — t- - under the condition r < r^. 

We now focus on the weak error estimate. Here M plays no role, so that M = 1 is the good 
choice! The time steps At and 5t are still given by (I4.3p . It remains to look for parameters 
N and nx such that 

^ ^ At. 



+ 1 

Once again, we need to choose Ht and such that A^^ or is large. Since exponential 
decrease is faster than polynomial decrease, the best choice is e"'^"^" ~ At, and therefore 
riT ^ toF'T^ log(to/"^), while A^ = 1. 

We then see that — — s° ^^^^^ additional factor appearing when only 
weak dissipativity is satisfied plays no role. 

_ 1 1 

The corresponding cost of the scheme is then of order ^ ~ log(to/ )tol i^'' 1/2-K . 

We can again compare this cost with the cost coming from the use of a direct scheme: the 
weak order of Euler scheme for SPDEs is 1/2 — k; to obtain the second estimate of (|4.2p . 
we need a cost 

6t 

and again the HMM scheme is better than the direct scheme for such a range of parameters. 

5. Preliminary results 

5.1. Known results about the fast equation and the averaged equation. In this section, 
we just recall without proof the main results on the fast equation with frozen slow component and 
on the averaged equation, defined below. Proofs can be found in |4j for the strict dissipative case, 
and the extension to the weakly dissipative situation relies on arguments explained below. 

li X € H, we define an equation on the fast variable where the slow variable is fixed and equal to 

x: 

dY,it,y) = iBY,{t,y) + Gix,Y,it,y)))dt + dWit), 

YA0,y) = y. 

This equation admits a unique mild solution, defined on [0, +oo[. 

Since is involved at time t > 0, heuristically we need to analyse the properties of Yx{-,y), 
with e — )• 0, and by a change of time we need to understand the asymptotic behaviour of 
when time goes to infinity. 

Under the strict dissipativity Assumption 13. 8| we obtain a contractivity of trajectories issued 
from different initial conditions and driven by the same noise: 

Proposition 5.1. With (jSPp . for any t>0, x,yi,y2 € H we have 

\Yx{t,yi) -Yx{t,y2)\H < e~^^^^*|?/i - y2|/f- 
11 



Under the weak dissipativity Assumption 13.91 we obtain such an exponential convergence result 
for the laws instead of trajectories. The proof of this result is not staightforward, and can be found 
in [8j - see also [2j for further references. 

Proposition 5.2. With (IWD|1 . there exist c > 0, C > such that for any bounded test function (j), 
any t>Q and any yi,y2 & H 

(5.2) |E0(y(t,yi)) - E0(y(t,y2))| < CM^{1 + \yi\^ + |y2|')e-^*. 

As a consequence, we can show that there exists a unique invariant probability measure associated 
with Yx, and that the convergence to equilibrium is exponentially fast. 

First, let u = AA(0, V2) be the centered G aussian probability measure on H with the 

covariance operator (—B)~^/2 - which is positive and trace-class, thanks to Assumption 13.11 

Then defined by 

(5.3) fi^idy) = ^e^^i-^y^idy), 

where Z{x) g]0, +00 [ is a normalization constant, is the unique probability invariant measure associ- 
ated to ■ This expression comes from the gradient structure of equation (j5.ip , given in Assumption 



Second, under any of the dissipativity assumptions, the convergence to equilibrium is exponential 
in the following sense: 

Proposition 5.3. If we assume (jSDh or (jWPp . there exist constants C, c > such that for any 
bounded function (j): H^Mor(j): H^H,t>0 and x,y £ H we have 



E<PiYxit,y))- / <l){z)f,^{dz)\<Cm^il + \y\l)e- 



'H 

Now we define the averaged equation. First we define the averaged nonlinear coefficient F: 
Definition 5.4. For any x £ H, 

(5.4) F{x)= I F{x,y)^f{dy). 

Jh 

Using Assumptions 13. 4| 13.61 and the expression of ^u^ , it is easy to prove that F is bounded and 
Lipschitz continuous. 

Under Assumptions [331 13.61 we can easily prove the following properties on F: 

Proposition 5.5. There exists < r] < 1 and a constant C such that the following directional 
derivatives of F are well-defined and controlled: 

• For anyxeH,he H, \DF{x).h\ < C\h\H. 

• For anyxGH,hGH,kG D{-A)'', \D^F{x).{h, k)\ < C\h\H\k\^_A)v ■ 

• For any x £ H, h,k £ H, \{-AyD^F{x).{h,k)\ < C\h\H\k\H- 
Moreover, F is bounded and Lipschitz continuous. 

The last estimate above is a consequence of Assumptions 1331 and [3771 and of the following fact: 
we have almost surely W^{t) G L°°(0, 1) for any t > 0, and 



(5.5) / \z\LoorQi\u{dz) < +CO. 

Jh 

Moreover if r/ > 1/4 - which is the right condition in the case of linear Laplace operators and 
of nonlinear Nemytskii operators - we have \z\(^_j^-^r,i'{dz) = +00: we thus need the restrictive 
condition in Assumption 13.71 
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Remark 5.6. Even when F and G are Nemytskii operators, F is not such an operator in general. 
Then the averaged equation (see (j2.2p in the introduction) can be defined: 

^ = AX{t) + F(X{t)), 

with initial condition X{0) = x £ H. For any T > 0, this deterministic equation admits a unique 
mild solution Y e C{[0,T], H). 

5.2. Estimates on the numerical solutions. We CcLii give uniforni estiniajtes on ctiid ^^^mjj 
defined by ESI and Ea 



Lemma 5.7. There exists C > such that we have ¥-almost surely 

\Xn\ <C(1 + |X|), 

for any <n <nQ. 

Proof The linear operator S^t satifies \SAt\c{H) ^ i+AAt ' moreover F is bounded, so that by (12. Sp 
we almost surely have for any n > \Fn\ < ||F||oo- The end of the proof is then straightforward; 
we also notice that C does not depend on the final time T. □ 

Lemma 5.8. There exists C > - which does not depend on T > 0, on N, on ut or on M - such 
that for any At>0, r = Y>0, 0<n< ng, < m < tuq and 1 < j < M , we have 

Proof We introduce ujn.mj defined by the fast numerical scheme with no nonlinear coefficient - 
see (j2.4p - with the notation t = y' for any < j < M, < n < uq and < m < niQ 

(5.6) 

with the initial condition Wn,o,j = 0. Wn,m,j is the numerical approximation - using the microsolver 
scheme with a step size r - of the process defined by 

dZ("j)(t) = SZ("'^')(t)dt + dVF("'^')(t) 

Z("'J')(0) = 0, 

where wj;"'''^^ = -^W^^'''\ Notice that the (W^"''^^) are independent cylindrical Wiener processes, 
and that 

(,n, 771+1,1 — 7= • 

The Z("'^) are independent realizations of Z{t) = W^{t) = e^''-''^^dW{s). Using Theorem 3.2 
of [14], giving the strong order 1/4 for the microsolver - when the initial condition is 0, with no 
nonlinear coefficient, with a constant diffusion term and under the assumptions made here - we 
get the following estimate: for any < k < 1/2, there exists > 0, such that for any r > 0, 
< n < no, 1 < J < M and < m < thq 

(5.7) n^n,m,j - Z("'^')(mr)|2 < C^t^'^-\ 

Thanks to (13. 3p and (15. 7p . for any tq > 0, there exists a constant C{tq) such that for any < r < tq, 
< n < no, 1 < i < M and < m < niQ we have 
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Now for any < m < mo we define Dn,m,j = Yn. ,m,j '^n,m,j] it IS enough to control |-D7i,m,j| ) 



P-almost surely. By (j2.4p . we have the following expression: for any < m < niQ 
Since G is bounded, and using the inequality \Rt\c{H) ^ TTJI^' 

I + fJ.T 

Therefore we have for any < m < mg 

(1 + flTy'^\Dn,m,j\ < \Dn,Oj\ + C[(l + ^t)™ - 1]. 
But DnfiJ = Dn-l,mo,j- So we get 

(l + ^rr°|Z)„+i,oj| < |Z)„,oj| + C[(l + ^Tro-l]. 

Therefore 

(1 + ^r)"-o|Z„,oj| < \Do,oj\ + C{1 + firr"'^ = \y\ + C(l + /ir)""^". 
As a consequence, we get for any < n < tiq and any < m < niQ, \Dnfij\ < C(l + \y\), and then 

\Zn,m,j\<2C{l + \y\). " □ 

5.3. Asymptotic behaviour of the "fast" numerical scheme. At the continuous time level, 
the averaging principle proved in [3j comes from the asymptotic behaviour of the fast equation with 
frozen slow component (|5.ip . as described in Section [5. 11 The underlying idea of the HMM method 
in our setting is to prove a similar averaging effect at the discrete time level: we therefore study 
the asymptotic behaviour of the fast numerical scheme which defines the microsolver, with frozen 
slow component - in other words we are looking at the evolution of the microsolver during one fixed 
macrotime step. 

In Section [5.11 we have seen that under the weak dissipativity Assumption 13.91 the fast equation 
with frozen slow component admits a unique invariant probability measure At the discrete time 
level, this Assumption only yields the existence of invariant laws; to get a unique invariant law /x^''^, 
we need strict dissipativity ISD| and we obtain: 



Theorem 5.9. Under Assumption \3.8l for any r > and any x G H; the numerical scheme 
5.9\ admits a unique ergodic invariant probability measure /i^'"^. Moreover, we have convergence to 
equilibrium in the following sense: for any tq > 0, there exist c > and C > such that for any 
< T < tq, X € H, y £ H, any Lipschitz continuous function (j) from H to R, and m > 0, we have 



(5.8) miY^iy))) - / H^h'''^idz)\ < Oil + \ymupe 



IH 

St 



We recall the notation t = — for the effective time step; the noise is defined with a cylindrical 
Wiener process W: Cm+i = ^('"+'^ ^"^^ . If we fix the slow component x G H, we define 

(5.9) YZ+i{y) = RrY:,{y)+TRrG{x,YZiy)) + V^RrU+i, 



with the initial condition Y^^y) = y. 
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5.3.1. Existence of an invariant law. With Equation (|5.9p . we associate the transition semi-group 
(Pm^): if is a bounded measurable function from H to M, if y £ H and m > 

(5.10) P;^'Xy)=E[<^(y^(y))]. 

We also denote by the law of Ym^{y): then 

We notice that the semi-group [Pm) satisfies the Feller property: if </> is bounded and continuous, 
then Pm(t> £ is bounded and continuous. 

The required tightness property is a consequence of the following estimate, which can be proved 
thanks to regularization properties of the semi-group {R^)m'- for any < 7 < 1/4, r > 0, there 
exists C(7, r) > such that for any m > 1 and r < tq 

<C(7,r). 

Moreover if < 7 < 1/4, the embedding of D{—B)'^ in H is compact. 



5.3.2. Uniqueness under strict dissipativity. The key estimate to prove uniqueness is the following 
contractivity property, which holds thanks to Assumption 13.81 



Proposition 5.10. For any tq > 0, there exists c > such that for any < r < tq, m > 0, 

yi,y2 ^ H , X £ H we have ¥-almost surely 

(5.11) \YZ{yi)-YZ{y2)\<e-'^^^\y^-y,\. 
Proof If we define = Ymiyi) — ^(2/2), then we have the equation 

rm+i = rm + rBRm+i + t{G{x, Ym{yi)) - G{x, Ym{y2))), 

ro = yi- y2- 

If we take the scalar product in H of this equation with r^+i, we get 

|rm+ip- < rm,rm+i > = r < Brm+i,rm+i > 

+ T < G{x, Ymiyi)) - G{x, Ym{y2)),rm+i >; 

The left hand-side is equal to i(|rm+iP — kmP) + ^km+i — "''mpj and we get 



^(|rm+ip - |r„p) < -r|(-5)^/2r„+ip + TLg\rjn\\rm+i\ 
< -fJ'T\rm+if + ^TLg{\rm+if + \r. '^^ 



Therefore we have (1 -|- r(2// — Lg))|rm+iP < (1 + TLg)\rm\'^ ■ We remark that ISDl implies that for 
any tq > 0, there exists c > such that if r < tq we have p = i_|_^(2/j-°L ) — ^"^^^i therefore 

\rm\'<pnyi-y2\'<e-^'"'^\yi-y2\'. 

□ 

As a consequence, there exists a unique ergodic invariant probability measure /i^'"^, which is 
strongly mixing. Moreover one can prove that there exists C > such that for any r > and any 
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£ H \y\fi^''^{dy) < C; we therefore get 



JH JH 



< / E|0(y^(y))-<^(y^(z)|^---(dz) 
JH 

< [0]Lip / e-'^^\y - z\^,-'^dz) 

JH 



< C[<^]Lipe- 



cniT 



5.3.3. Approximation of the invariant law /u^ by the fast numerical scheme. We recall that /i^ de- 
notes the invariant law of the continuous time fast equation with frozen slow component (|5.ip . 
Thanks to the fast numerical scheme, we can get an approximation result, which is proved in [2]: 
with test functions of class C^, we can control the weak error for any time with a convergence of 
order 1/2 with respect to the time step r. Moreover the estimate is easily seen to be independent 
from the slow component x. 
We define for any ^ of class 

11^11(2) = sup |$(y)| + sup \Dy^{y).h\+ sup \Dl^{y).{h,k)\. 

yen yeH,heH,\h\=l yeH,h,keH,\h\=\k\=l 

Theorem 5.11. With the dissipativity condition (jWPp . for any < k < 1/2, for any tq > 0, there 
exists C, c > such that for any $ of class C^, for any x,y £ H, for any < r < tq and any integer 
2 < m < +00 

\EMY,{mT,y))] - E[$(y^(y))]| < C||<1>||(2)(1 + |yP)(((m - 1)t)-V2+« + 1^/^-. 

As explained in Section [5?3l in general the fast numerical scheme does not admit a unique invariant 
probability measure, but when the strict dissipativity Assumption 13 . 81 is satisfied, it admits a unique 
invariant law /i^'"^. 

Corollary 5.12. Under the assumptions of Theorem \5.11\ 
(i) if only (jWPp is satisfied, we have for any m >2 

I / <^dn' - EMYZ)]\ < C||$||(2)(l + |y|')(((m - 1)t)-i/2+k ^ i)^i/2-K + C77V($)(i + \y\-')e-'^\ 

JH 

(a) if moreover (jSPp is satisfied, 

I / «>d^^- / «>d^^'n < C||$||(2)ri/2-^ 
Jh JH 

We recall that in the case of Euler scheme for SDEs this kind of results holds with the order of 
convergence 1. 

5.4. Error in the deterministic scheme (|5.12p . We define a scheme based on the macrosolver, 
for theoretical purpose, in the situation when F is known: 

(5.12) _ 

Ao = X. 

We can look at the error between X„, defined by (j5.12p . and X(nAt), defined by (|2.2p . Here 
quantities are deterministic, and the following result is classical - see |13| . [S], or the details of the 
proofs in [14j: 
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Proposition 5.13. For any < r < 1, Ato > and T > 0, there exists C > 0, such that for any 

c 



0<At< Ato and 1 < n < [-^ 



\Xn - X{nAt)\ < - + C(l + \x\)At 



l-r 



n 



6. Proof of the strong convergence Theorem 14.11 

Thanks to the fohowing remark on the construction of the scheme, we can consider that ^,m,j 
corresponds to a process evaluated at time m + nmo- The idea is to build noise processes C^^^ by 
concatenation of the (^("''■?) for the different n. 

Remark 6.1. To compute Yn^mj with the microsolver, the total number of microtime steps used is 
nrriQ + m. It is then natural to use only one noise process (!^^^\ for each 1 < j < M , and to make 
an evaluation involving time nrriQ + m. More precisely, we can use 

Aj) _ ik+l)St ^^kSt 



/6t 

with i<j<M being M independent cylindrical Wiener processes on H. 

Then one can define for < t < m^dt and for < m < m-o — 1 

= W^^\t + nmoSt) - W^^\nmQ5t) 

r — 

<,n,m+l,j — Snmo+m+1- 

We also introduce the following notation: E„ denotes conditional expectation with respect to the 
cr-field 

(6.1) Gn = (T{Ck,m,j, < k < n - 1,1 < m < mo,l < j < M) = a(C^), 1 < m < nmo). 

We notice that Xn is ^„-measurable, but that Fn is not. 

The final time T is fixed and we recall the notation uq = [-^J . 

To simplify notations, we do not always precise the range of summation in the expressions below: 
the indices j,ji,j2 belong to {1, . . . , M}, and m, mi,m2 belong to {ut, . . . , riy + — 1 = mo}- 
We recall that according to the decomposition of the error (|4.ip . we have to control 

E\X^{nAt) - Xn\ < E\X^{nAt) - X{nAt)\ 

(6.2) +|X(nAt)-X„| 

+ E\Xn-Xn\. 

The first part is controlled thanks to the strong order Theorem of [3j: for any 0<r<l/2, we have 
for any < n < uq 

The second part is deterministic and is controlled thanks to Proposition 15.13] 

\Xn - X{nAt)\ < ^ + C(l + \x\)At^-\ 

where C depends on T,r,x,y. 

It remains to focus on the third part e„ = Xn — Xn- Instead of analyzing the local error like in 
|12| . we adopt a global point of view, and we follow the idea of the proof of Theorem 1.1 in [3j: for 
any < n < no 

n— 1 n— 1 

(6.3) Xn-Xn = Sl^x + At ^ Sl[^Fk - Sltx - At ^ SlfF(Xk). 

k=0 k=0 
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The averaged coefficient F is Lipschitz continuous, and \S/^t\c{H) ^ 1; moreover we can define the 
averaged coefficient with respect to the invariant measure /i^'"^ of tlie fast numerical scheme - 
which is unique since we assume strict dissipativity (jSPp : for any x G H 

(6.4) F^{x) = [ 

The error in (j6.3p can then be decomposed in the fohowing way - the idea of looking at the square 
of the norm in the second expression is an essential tool of the proof: 

n-l 



(6-5) + (mtY.Sl^'Fk - 

V fc=0 / 

n-l 

+ AtJ2nSl-'^F{Xk)-Sl-''F^{Xk)\. 



k=0 



If we can control the two last terms by a certain quantity Q, by a discrete Gronwall Lemma we get 
for any < A; < no E|X„ - X„| < e'^^Q. 

First, the third term in (I6.5P is linked to the distance between the invariant measures fi^ and 
^x,T _ gjj^gg assume strict dissipativity for this strong estimate - which is evaluated thanks to 



Theorem 15.111 and Corollarv 15. 121 for test functions of class C^. Since \ {—A)^S'^^ \c{H) 
using Assumption 13. 5| we can apply Corollary 15.121 to obtain 



n—k\ ^ C 



{{n-k)At)'^ ■ 



and summing we get 

n-l 



k=0 



The other term is more complicated; in order to get a precise estimate, we expand the square of 
the norm of the sum. We can then use some conditional expectations, which allow to use exponential 
convergence to equilibrium via Theorem 15.91 Therefore we obtain the following expansion: 



n—l n—1 



mtJ^SM^h - Sl-''T{Xk)\^ = At^^E\Sl-HFk-F'{Xk))\^ 

k=0 k=0 



0<ki<k2<n-l 

(i) We first treat Si. 

By (12. 5p , for any 0<A;<n— Iwe have 



^ M riT+N-l 



j=l m=nT 
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therefore we can see that 

nsi,\Fk-T{Xk))\'' 

jl ,32 m\ ,m2 

with the conditional expectation with respect to Qk - see (j6.ip . 

When jl 7^ j2, Cm'^ ^^^^ Cm^^ ^^re independent, so that we treat differently the cases ji = j2 and 
jl 7^ j2 in the above summation, and we obtain 

M^N^E\Sl-''iFk-F^{Xk))\^ 

M 

+ E E ^ < Sl-i'{F{Xk,Yk,„^,J)-F^{Xk)),Sl-;HF{Xk,Yk,„,,,J)-F^{Xk)) > • 

j=l mi,m2 

For the first part, we can directly use the exponential convergence to equilibrium result of 
to have a bound with 



1 J"" 1 p-cnyr _ —c{N+nJ')^ 
(_ \^ -cmT\2 ^ (J_^ ^ 



'N ^ ' -^N 1 - e 

m=nT 

1 _ p-cA^T 

< iCe-^''^^ ■ 

~ ^ Nt ■ 



n cnrr 
< ( f. 

For the second part, we see that we can treat only the case mi < m2, and we introduce the 

conditional expectation E^^mij with respect to the c-field generated by Qk and iCkmo+m)o<m<mi-i, 
when mi < m2- We get a bound with 

g-c(m2-mi)r ^ ^ 



MAf2 - m{Nt + 1) 



We therefore get 
(6.6) 



('m^ We now consider S2, which corresponds to the cross-terms in the expansion of the square of 
the norm of the quantity {F^ — F^ {Xk)) ■ By the definition of F^, the general term with indices 
ki < k2 in IS2I is bounded by 



|E < Sl-'^{F,,-TiX,,)),Sl-'-iF,,-riX,,)) > 



At^ — T — r 



At2 ^ 



using conditional expectation E^j and the boundedness of F. 
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Using the exponential convergence result of (|5.8p and Lemma [5.81 we get the bound 
so that the previous quantity is bounded by 

MN ^ ^ - Nt + 1 

m=nj' j=\ 

Summing on ki < k2, we can now conclude that 

p-criTT 

(6.7) E2 < C- 



Nt + V 
then by and (16^1) 

E|At ^ 5^7^F. - SX^'^r < C(—-— + Ati——f + 



.Nt + 1 + M{Nt + 1] 



and the result of Theorem 14.11 now follows from (j6.5p . 

7. Proof of the weak convergence Theorem 14.21 

In order to get a better bound for the weak error than for the strong error, we use an auxiliary 
function which is solution of a Kolmogorov equation. 

The proof is valid under the general dissipativity Assumption 13.91 

We divide the proof in two parts: the first one contains the elements of the proof, while the 
second one is devoted to two technical lemmas. 



7.1. Proof of the Theorem. According to the decomposition (14. ip . we want to control for any 
< n < no 

\E^{X'{nAt)) -E$(X„)| < \E^{X'{nAt)) -E$(X(nAt))| 

(7.1) +|$(X(nAt))-$(X„)| 

+ |$(X„)-E^.(X„)|. 

Thanks to the averaging Theorem of [3] , which is proved under the weak dissipativity assumption 
(jWPp . the first term above can be controlled by Cre^~^ , where Cr is a constant - depending on 
r, <I>, X, y, T, for any < r < 1. 

For the second term, since we look at the error made by using a deterministic scheme to approxi- 
mate a deterministic equation, there is no difference between the strong and the weak orders - since 
the test function $ is Lipschitz continuous; so we again use Proposition 15. 13l 

For the third term, we see that we have to control some error between two different numerical 
schemes, in a weak sense. The usual strategy is to decompose this error by means of an auxiliary 
function satisfying some kind of Kolmogorov equation. 

More precisely, we use the deterministic scheme defining X^ in order to define for any < A; < n 

(7.2) Un{k,x) = ^(Xn-k{x)), 

where we explicitly mention dependence of the numerical solution X^ on the initial condition x. 
Remark 7.1. We can easily prove that we have 

Un{n,x) = ^{x) 
Un{k, x) = u{k + 1, Si^tx + AtS^tF{x)) for any k < n. 
This is the way this function is defined in [12j. 

20 



We now analyse the error by identifying a telescoping sum: 

|$(X„)-E$(X„)| = |u„(0,x) -Etx„(n,X„)| 

n-l 

= \y^{Eun{k,Xk)-Eun{k + l,Xk+i))\ 
(7.3) t^o 

<Y,\^Un{k,Xk) -EUn{k + l,Xk+l)\. 
k=0 

According to Lemma 17.21 below. n„ is of class C^, and we can control the first and second order 
derivatives: 

Lemma 7.2. For any < T < +oo, there exists Ct > such that for any < n < no = [^^J and 
< k < n, we have, for any x £ H, h €z H , /ii, /i2 G H: 

\DxUn{k,x).h\ < Ct|/i|, 

\Dl^Un{k,x).{hi,h2)\ < Ct|/ii||/i2|- 

Moreover for any < k < n — 1 and any x €z H, h €z H , 

\f^\{-A)-v 



\D^Unik,x).h\ < Cr(At|/i| + 



((n - k)At)T 
where r] is defined in Assumption\!J.4 



Since the auxiliary functions u„ are linked to the deterministic discrete-time process (Xn)n>Oi 
the proof does not use stochastic tools. 

Moreover the second and the third estimates of this Lemma reveal some smoothing effect in the 
equation, due to the semi-group {S'^^)n£n- These results are specific to the infinite dimensional 
framework; if we only use the first estimate above, and a simple one like 

\Dl^Un{k,x).{hi,h2)\ < CT\hi\\h2\(-A)v, 

we would not obtain an optimal convergence result. 

To make the proof of Theorem 14.21 clearer, we postpone the proof of Lemma 17.21 in Section 17.21 
In the general term of (17. 3p . we proceed with a Taylor expansion and we get 

\Eunik, Xk)-Eun{k + l,Xk+l)\ 

= \Eun{k + 1, SAtXk + AtSAtF{Xk)) - Eun{k + 1, S^Xk + AtSAtFk)\ 

^^■^^ < At\ED.,Un{k + 1, SAtXk + AtSAtF{Xk)).{SAtF{Xk) - SAtFk)\ 

+ CAt^E\Fk-F{Xk)\''. 

The second term is controlled by using the second estimate of the previous Lemma 17.21 Under the 
assumption that F is bounded, we therefore get a O(At^) term; when summing over < k < n — 1, 
we get a 0(At) term, which is already dominated in the final estimate. 
When k = n — 1, since Un{n, .) = ^, 

(7.5) |En„(n- -En„(n,X„)| < CAt. 

In the rest of the proof, we focus on the general case < A; < ?i — 1. 

For the first term, we do not exactly follow the proof of |12j : we rather define auxiliary functions 
for < A; < n in order to keep on looking at a weak error term: 

(7.6) ^n(k,x,y) = D^Un{k,SAtx + AtSAtF{x)).{SAtF{x,y)). 
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Then if we define ^ni^ + 2;) = ^nik + 1, x, y)fi^{dy) we have 
\ED,Un{k + 1, SAtXk + AtSAtF{Xk)).{SAtF{Xk) - SAtFk)\ 

riT+N-l M 
m=nj- j=l 

By using conditional expectation with respect to defined by (I6.ip . 

does not depend on j, so that in the sequel we fix j G {1, ... , M}. 

The following Lemma gives the regularity results for the auxiliary functions: 

Lemma 7.3. For any < T < +00 and Ato > 0, there exists a constant C , such that for any 
< At < Ato, CLny 1 < n < no = [^J and any < A; < n — 1, the following derivatives exist and 
are controlled: for any x,y €z H , h,k ^ H , 

\Dy'^n{k,x,y).h\ < C\h\H, 
\Dl^nik,x,y).h\ < C{1 + 

Like in Lemma [7.21 the proof relies smoothing effect of the semi-group (S^J^gN. The Lemma is 
proved below in Section [7.21 

We can then apply Theorem 15.111 valid with the weak dissipativity condition (jWPp , using the 
conditional expectation E^: for any ut < m < niQ and k < n — 1 

\E^n{k + l,Xk,Yk,mj) - E'^nik + 1, < Ce"^™^ + C{1 + ^\ J r^/'"". 

Therefore 

\ED^Un{k + 1, SAtXk + AtSAtF{Xk)).{SAtF{Xk) - SAtFk)\ 

^ riT+N-l AI 

m=nT j=l 
-criTT 1/2-K 

< C^^ + Ct^/^"" + C- 



Nt + 1 ((nT-l)T)i/2-'^- 
To conclude, it remains to use (|7.3p and (|7.ip . 

Remark 7.4. When the strict dissipativity Assumption is satisfied, we can indeed obtain a bound 

without 77 , ,-^,-1/, ^ .■ we can control the distance between the invariant measures of the continuous 

and discrete time processes, thanks to the second part of Corollarv \5.1^ 

7.2. Proof of the auxiliary Lemmas 17.21 and 17.31 

Proof of Lemma 17.21 We use the following expression for 

n-l 

X„(x) = SltX + AtY,Sl-^'F(XUx)). 

k=0 

By definition, for any < A; < n we have Un{k,x) = <I>(X„_fc(x)); we see that the derivatives in 
directions h,hi,h2 £ H are given by 

D^Un{k,x).h = D^(Xn^k{x)).{-^Xn-k{x).h), 

dx 
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and 

— (f — 

Dl^Un{k,x).{hi,h2) = D<^{Xn-k{x)).{-j-^Xn-k{x).{hi,h2)) 



+ D^^Xn-k{x)).{^Xn.k{x).hi,-^Xn-k{x).h2). 
ax ax 

$ is of class on H, with bounded derivatives; therefore we just need to control and 

■^Xn-kix).{hi,h2)- We use the following estimates of the derivatives of F, given in Proposition 
15.51 for any x €z H , h £ H , hi, h2H, 



\DF{x).h\ < C\h\ 
\{-ArW^F{x).{hi,h2)\ < C\h\\h2\. 
(i) For any < n < no, we can write 



T n-l , 

^Xn{x).h = Sl,h + MY,Sli''DF{Xk{x)){—Xj,{x).h)- 

k=0 



therefore 

\^Xn{x).h\ < \h\+C/\tY,\—Xk{x).h\, 

k=0 



and a discrete Gronwall Lemma then yields 



\^Xn{x).h\ < \h\e^^^' <\h\e^^. 
ax 

(ii) For any < n < uq, we can write 

k=0 

J2 _ 

fc=0 

Since \S'^''{—Ay'\ < (^(^n-k)At)v "^^en k < n, and thanks to the previous estimates on ■^Xk{x).h 
and Z^^F, we get 

Therefore 

\^Xn{x).{hiM)\ < C\hi\\h2\ 

n— 1 ^2 

fc=0 

and a discrete Gronwall Lemma then yields 

\^Xn{x).{hi,h2)\ < C\hi\\h2\. 

dx^ 
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( in ) To prove the last estimate of the Lemma, we write 

k=0 

c "-'^ d — 



We obtain 



{nAt)'^\^Xn{x).h\ < C\h\^_Ay, + CAt{nAty\h\H 



dx 

n-l 



+ C{nAtrAtY,—{JAt) 



To conclude, we use Gronwall Lemma, to get 

d 

dx 



inAt)^\^Xn{x).h\ < CTm{-A)-i+^t{nAtr). 



□ 



Proof of Lemma 17.31 

(i) The first derivative with respect to y is easy to control: we have for any h £ H 

Dy^n{k, X, y).h = DxUn{k, Smx + AtS/\tF{x)).{S/\tDyF{x, y).h), 

and we get {Dy'^nik, x,y).h\ < C\h\H- 

(a) When we look at the second-order derivative, we see that 

Dly^n{k, x, y).{h, k) = D^unik, Saix + AtSAtF{x)).{SAtDlyF{x, y).{h, k)). 

Thanks to the last estimate of Lemma I7.2| we can control the expression by 

nfA.iQ n2 ^\SAtDl F{x,y).{h,k)\(^_A)-A 
C \At\SAtDyyF{x,y).{hMH + ((n - k)At)^ J ' 

We then notice that 

At\SAtDlyF{x,y).{hMH < CAt^-^\{-A)-^DlyF{x,y).{hM 

<C\h\H\k\H, 

since r/ < 1 and At is bounded, and thanks to Assumption 13.5} the other part is controlled thanks 
to Assumption 13.51 □ 
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